TextControlGAN: Text-to-Image Synthesis with Controllable Generative Adversarial Networks

نویسندگان

چکیده

Generative adversarial networks (GANs) have demonstrated remarkable potential in the realm of text-to-image synthesis. Nevertheless, conventional GANs employing conditional latent space interpolation and manifold (GAN-CLS-INT) encounter challenges generating images that accurately reflect given text descriptions. To overcome these limitations, we introduce TextControlGAN, a controllable GAN-based model specifically designed for synthesis tasks. In contrast to traditional GANs, TextControlGAN incorporates neural network structure, known as regressor, effectively learn features from texts. further enhance learning performance data augmentation techniques are employed. As result, generator within can texts more effectively, leading production closely adhere textual conditions. Furthermore, by concentrating discriminator’s training efforts on GAN exclusively, overall quality generated is significantly improved. Evaluations conducted Caltech-UCSD Birds-200 (CUB) dataset demonstrate surpasses cGAN-based GAN-INT-CLS model, achieving 17.6% improvement Inception Score (IS) 36.6% reduction Fréchet Distance (FID). supplementary experiments utilizing 128 × resolution images, exhibits ability manipulate minor bird according These findings highlight powerful tool high-quality, text-conditioned paving way future advancements field

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

Generative Adversarial Text to Image Synthesis

Automatic synthesis of realistic images from text would be interesting and useful, but current AI systems are still far from this goal. However, in recent years generic and powerful recurrent neural network architectures have been developed to learn discriminative text feature representations. Meanwhile, deep convolutional generative adversarial networks (GANs) have begun to generate highly com...

متن کامل

Medical Image Synthesis with Context-Aware Generative Adversarial Networks

Computed tomography (CT) is critical for various clinical applications, e.g., radiotherapy treatment planning and also PET attenuation correction. However, CT exposes radiation during acquisition, which may cause side effects to patients. Compared to CT, magnetic resonance imaging (MRI) is much safer and does not involve any radiations. Therefore, recently, researchers are greatly motivated to ...

متن کامل

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) aimed at generating high-resolution photorealistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for textto-imag...

متن کامل

Controllable Generative Adversarial Network

Although it is recently introduced, in last few years, generative adversarial network (GAN) has been shown many promising results to generate realistic samples. However, it is hardly able to control generated samples since input variables for a generator are from a random distribution. Some attempts have been made to control generated samples from GAN, but they have shown moderate results. Furt...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13085098